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Abstract 

We present a general form of attribute exploration, a knowledge completion algo- 
rithm from Formal Concept Analysis. The aim of our presentation is not only to extend 
the applicability of attribute exploration by a general description. It may also allow to 
view different existing variants of attribute exploration as instances of a general form, 
which may simplify theoretical considerations. 

1 Introduction 

Attribute exploration is a well known algorithm within formal concept analysis |9]. Its 
main application can be summarized as semi-automatic knowledge base completion. Within 
this process, a domain expert is asked about the validity of certain implications in the 
domain of discourse. Based upon the answer of the domain expert, the algorithm enhances 
its knowledge until all implications are known to hold or not to hold in the domain, and 
the algorithm stops. 

Attribute exploration has gained much attention since its first formulation, and for cer- 
tain problems variations of attribute exploration have been devised where the original 
algorithm was not applicable. Those variations include attribute exploration on partial 
context |3| and exploration of models of the description logic SC |1 , 2|, among others. 

However, in almost all variations of attribute exploration that have been devised the 
overall structure of the algorithm remains the same. Furthermore, all important properties 
of attribute exploration remain, and one might be tempted to ask whether a general form 
of attribute exploration can be found that subsumes all many of these variations. The 
purpose of this work is to present some first considerations into this direction. 

We shall proceed as follows. After introducing the mandatory definitions in the first 
section we briefly revisit the classical description of attribute exploration as it is given in 1 9 1 . 
Starting from this, we motivate our generalizations and summarize the resulting algorithm 
together with its properties in the succeeding section. We shall have a close look at a spe- 
cial cases which involves pseudoclosed sets and results in some very nice results about the 



attribute exploration algorithm. Finally, we shall summarize our considerations and give 
an outlook on further questions. 



2 Preliminaries 

As attribute exploration is an algorithm from Formal Concept Analysis, we shall begin 
by introducing some basic definitions from within this field. This includes notions like 
formal contexts, contextual derivations, implications, partial contexts and pseudoclosed 
sets. We shall furthermore recall the notion of closure operators on sets, which we need 
for our considerations. 

Let G and M be two sets and let I C G x M. Then the triple K := (G, M, I) is called a 
formal context. We shall connect with it the following interpretation: The set G is the set of 
objects of K, M is the set of attributes of K and (g, m) is an element of the incidence relation 
I if and only if the object g has the attribute m. We may also write glm if (g, m) E I. If K 
is a formal context, then the set of objects, attributes and the incidence relation is denoted 
by Gk, Mr and Ik, respectively. 

Let us fix a formal context K — (G, M, I). If A C G, then the set of common attributes 
of A in K is denoted by 



denotes the set of all common objects of B in K. The sets A' and B 1 are called the (contex- 
tual) derivations of the respective sets, and the operators named (■)' are hence called the 
derivation operators of K. 

Lemma 1 LetK = (G, M,l) be a formal context and let A, A\,Ai C M, B, By B2 Q G. Then 
the following statements hold: 



i) A\ C A 2 A' 2 C A\ 

ii) B 1 CB 2 =^> B' 2 C B[ 
Hi) A C A" 

B C B" 

v ) A' = A'" 

vi) B' = B'" 

vii) ACB' A' DB 



A' := {m e M \ Vg E A : glm } 



and likewise for B C M, 



B' ■- {g e G I Vm 6 B : glm} 
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As we view the elements of G as objects with certain attributes from M, we may ask for 
two sets A,B C M whether all objects having all attributes from A also have all attributes 
from B. This can be rewritten in terms of the derivations operators as A' C B' . We shall 
call the pair (A,B) an implication on M and denote it as A — > B. If K is a formal context 
with attribute set M, then we may also say that A — > B is an implication of K. Then 
A is called the premise and B the conclusion of the implication. If indeed A' C £>', we 
shall call A — > B a valid implication of K, and we may write K |= (A — > B). As 
A' C B' ^=> B C A", we can observe that 

K\= (A — > B) 4=^ B C A". 

We shall denote with Imp(M) the set of all implications on M, with Imp(K) the set of all 
implications of K and with Th(]K) the set of all valid implications of K. 

Let £ C Imp(K) and let ACM. The set A is closed under £ if for all implications 
(X — > Y) e £ it holds that X % A or Y C A. Let us further define 

£°(A) := A, 

£ a (A) := [J{ Y | (X — > Y) e £,X C A }, 
C i (A):^C 1 (C i - 1 (A)) fori>l, 

and 

£(A) := |J C\A). 
ieN 

The set £(A) is then the smallest superset of A that is closed under £. 

The set Th(K) might be quite large, and to handle this set in practical applications it is 
desirable to represent it by a small subsets. To see how this is done let £ C Imp(K) and 
let (A — > B) E Imp(K). Then C entails A — > B, written as C \= (A — > B), if and 
only if B C C{ A). A set B C Imp(K) is called sound for C if every implication from B is 
entailed by C. B is said to be complete for C if every implication from £ is entailed by B. 
If B is both sound and complete for C, it is called a kzse for £. It is called a non-redundant 
base for £ if it is C -minimal with respect to this property. 

Let us denote with Cn(£) the set of all implications that are entailed by £. Then 

B is sound for £ <^=> B C Cn(£), 
B is complete for £ <^=> Cn(£>) D £. 

In particular, B is a base for £ if and only if Cn(Z3) = Cn(£). 

From all possible bases for £ one can explicitly describe a canonical base for £ which 
has the remarkable property that it has minimal cardinality among all bases for £. Let 
PCM. Then P is said to be pseudoclosed under £ if 

1. P ^ £(P) and 

2. for all pseudoclosed sets Q C P it follows £(Q) C P. 
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In particular, if £ = Th(K), then P is said to be a pseudointent o/K. Now the canonical base 
for £ is defined as 

Can(£) := { P — > £(P) | P pseudoclosed under £ }. 

Formal contexts require a certain kind of complete knowledge about their objects: If 
g € G and m E M then either g has the attribute m or not. Under certain circumstances 
this might be inappropriate, because it might not be known whether g has the attribute m, 
or it is simply irrelevant for the task at hand. Therefore we shall introduce partial contexts. 

Let M be a set. Then a partial context K is a set of pairs (A, B) with A, B C M such 
that A (~l B = 0. Such a pair is called a partial object description if A U B 7^ M and a/u/Z 
ofr/ecf description if A U B = M. Intuitively, one can understand partial objects descriptions 
as a pair of positive attributes, i. e. attributes the corresponding object definitively has, and 
negative attributes, i. e. attributes the corresponding object definitively does not have. The 
objects itself are not named in partial contexts. 

An implication for K is just an implication on M. Such an implication (A — > B) € 
Imp(M) is refuted by K if there exists a partial object description (X, Y) E K such that 
A C X, B n Y 7^ 0. If A C M, then the C -maximal set B such that A — > B is not refuted 
by K exists and is given by 

K(A) := B := M\(J{ Y I (X,Y) e K, A C X}. 

As it turns out, the operators (■)", £(•) and K(-) are instances of the more abstract 
notion of closure operators on sets. Let again M be a set. Then a function c: *P(M) — > 
*P(M) is said to be a closure operator on M if and only if 

i) c is extensive, i.e. A C c(A) for all ACM, 

ii) c is idempotent, i. e. c(c( A) ) — c(A) for all ACM, 

iii) c is monotone, i. e. A C B => c(A) C c(B) for all A, B C M. 

Both (■)" and £(•) are closure operators on their corresponding sets of attributes. A set 
A C Mis said to be closed under c if c(A) = A. The set of all closed sets of c, i. e. the image 
of c, is denoted by im c. A set P C M is said to be pseudoclosed under c if and only if 

i) P 7^ c(P) and 

ii) for all pseudoclosed Q C P, it holds that c(Q) C P. 

We shall write Cj (•) C C2(-) for two closure operators Ci, C2 on a set M if and only if 
Cl(A) C c 2 (A) for all ACM. 

3 Classical Attribute Exploration 

Given a finite set M, attribute exploration semi-automatically tries to determine the set of 
implications that are valid in a certain domain. Together with a set /C of already known 
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valid implications and a formal context K of valid examples, attribute exploration gener- 
ates implications A — > B that hold in K but are not entailed by /C. Those implications are 
asked to the expert for validity. If A — > B holds in the domain of discourse, it is added 
to the set K. Otherwise the expert has to present a counterexample for A — > B that is 
added to the formal context K. The procedure terminates if there are no such implications 
left. 

To describe attribute exploration more formally, let us define what is meant by a do- 
main expert. 

Definition 2 Let M be a set. A domain expert on Mis a function 

p: Imp(M) — > { T } U q?(M), 

where T is a special symbol not equal to any subset of M, such that the following condi- 
tions hold 

i) If X — > Y is an implication on M such that p(X — > Y) — C ^ T, then X C 

C, Y % C. (p gives counterexamples for false implications) 

ii) If A — > B and X — > Y are implications on M such that p(A — > B) — T and 
p(X — > Y) = C ^ T, then C is closed under { A — > B }, i. e. A % C or B C C. 
(counterexamples do not invalidate correct implications) 

If p(A — > B) = T, then we say that p confirms A — > B. Otherwise we say that p rejects 
the implication and we call the set C = p(A — > B) 7^ T a counterexample from p for 
A — > B. Finally, the theory of p is just the set of implications that p confirms, i. e. 

Th(p) := p-\{ T }) = {A — > B \ p(A — > B) — T}. 

An immediate consequence of the definition is the following observation. 

Lemma 3 Let Cbea set of implications such that a given domain expert p confirms every impli- 
cation in C If C \= ( A — > B), then p confirms A — > B as well. 

Proof Suppose that p(A — > B) = C ^ T. Then C is closed under £. This means that 
£(C) = C. Since C \= (A — > B), from A C C it follows that 

B C C{A) C £(C) = C, 

i. e. C is not a counterexample for A — > B, a contradiction. □ 

Before we are able to describe the attribute exploration algorithm more formally, we 
need to give another definition. 

Definition 4 Let M be a finite set and let < be a total order on M. Then for A, B C M and 
2 6 M we define 

A ~<i B :^=^ 2 = min<(AAB), 
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where AAB = (A \ B) U (B \ A) is the symmetric difference between A and B. If A -</ B, 
we say that A is lectically smaller than B at i. Furthermore, A is lectically smaller than B, 
written as A -< B, if there exists i 6 M such that A -</ B. Finally, 

A<B A = B or A ~< B. 

It is easy to see that H constitutes a linear ordering on *}3(M). 

We are now able to describe the process of attribute exploration in a more formal way. 

Algorithm 5 (Classical Attribute Exploration) Let M be a finite set, K be a formal context 
with attribute set M and let K C Imp(M) and let p be a domain expert on M. Suppose 
that JC C Th(p) C Th(K). 

i) Initialize P to the lectically first closed set of JC ( • ) . 

ii) If p" = p, then go to|V] Otherwise let r :— (P — > P"). 

iii) If p confirms r, then add r to JC. 

iv) If p gives a counterexample C for r, add a new object to K which has exactly the 
attributes in C. 

v) Let Q be the lectically next closed set after P of JC. If there is none left, terminate. 
Otherwise, set P to Q and go to IB] 

In any iteration, the current value of JC is called the set of currently known implications and 
the current value of K is called the current working context. <£> 

A first easy observation for this algorithm is the following: Suppose the expert p is 
called with an implication A — > B during the run of the algorithm. Let K, be the currently 
known implications at this time, and let likewise K denote the current working context. 
Then for each m € B both Th(p) \= (P — > {m})andTh(p) |£ (P — > {m}) is 
possible. In other words, the question whether Th(p) |= (P — > { m }) is not influenced 
by the values of JC and K but solely depends on how the expert p answers. Hence all 
questions to the expert can be seen as non-redundant. 

This property is very important especially in the presence of human experts which 
may not only be expensive to answer but might also get impatient when getting asked 
implications the algorithm could have inferred by itself. Therefore, this property should 
of course also hold for our generalized formulation of the attribute exploration, and it does, 
as we shall see. 

But before we do so, we shall mark down some of the major properties of this attribute 
exploration algorithm. 

Theorem 6 Let M be a finite set, < a total order on M, K a formal context with attribute set M, 
JC a set of implications on M and let p be a domain expert on M, such that p confirms K. and all 
implications confirmed by p hold in K, i. e. JC C Th(p) C Th(K). 

i) The attribute exploration algorithm terminates with K, JC and p as input. 
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ii) Let K,' and K' be the values corresponding to K, and K after the last iteration of the attribute 
exploration algorithm. Then K! is a base for Th(K'). 

Hi) Th(p) = Th(K') and the corresponding closure operator coincides with fC'(-). 

iv) The cardinality offC'\JC is the smallest possible. 

v) The premises in /C' \ K. are the /C-pseudoclosed of TTz(K'). Thereby, a set P C M is said 
to be IC-pseudoclosed under Lfor K,, C C Imp(M), if and only if 

i) P = /C(P), 

ii) P^C{P), 

Hi) for each IC-pseudoclosed set Q C P of C it holds that C(Q) C P. 

All but the last statement of the theorem are known from |9l[lTJ[6). The last statement has 
been mentioned partially in |11| and has been proven completely in |5|. 

4 Generalizing Attribute Exploration 

We shall now proceed by investigating the above description of attribute exploration for 
possible generalizations. While doing so, we shall not only generalize certain aspect of the 
algorithm but also generalize those aspects intuitively. The main aim of our generalization 
is to describe attribute exploration in more abstract terms, to allow applications of the 
algorithm beyond those of the classical algorithm. 

Let p be a domain expert on a set M. We start with an informal introduction of our 
generalizations, of which we shall name three: 

1. The use of the initial formal context K and the background knowledge /C can be re- 
duced to their corresponding closure operators ( • ) " and /C (• ) . The only major problem 
here is the handling of counterexamples, which we shall discuss latter in detail. Hence 
instead of passing the attribute exploration algorithm a formal context and some back- 
ground knowledge in the form of a set of valid implications, we instead provide two 
closure operators c un i v and c cer t on the set M. 

The closure operator takes the place of Th(K)(-) and represents the universal 
knowledge we already have about our domain of discourse. If A C M is a set of at- 
tributes, then c un i V (A) represents the attributes that can follow from A. Seen from 
another perspective, M \ c un i v (A) is the set of attributes that do not follow from A. 

In contrast to this, the closure operator c cer t represents the certain knowledge we al- 
ready have. In other words, c cei t(A) is the set of all attributes that definitively follow 
from A. This closure operators hence takes the place of the set K, of initially known 
implications. 

Clearly, we need to have c cer t ( ■ ) Q Th ( p ) ( • ) C c un j v ( ■ ) . 
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2. When providing counterexamples, we observe that we actually do not need to com- 
pletely specify them. It merely is sufficient to provide information on which attributes a 
certain object has and which it not, as long as this information contradicts a proposed 
implication. We shall take this approach and extend the algorithm to store those coun- 
terexamples in a partial context. This idea has also been discussed in |3l[8l. 

3. The implications which are proposed to the expert are of a very special form, which 
guarantees certain optimality statements about the algorithm. However, for the main 
application of knowledge acquisition and knowledge completion, this rather special 
form can be viewed as a certain kind of optimization. To drop it, we may rather say that 
in any iteration step of the attribute exploration algorithm, we search for an undecided 
implication with respect to the current values of c cer t and c un j v , i. e. an implication A — > 
B on M such that c cer t(A) £ B C c un j v (A) and where both A and B are finite. For 
such an implication we cannot infer from c cer t and c un j v whether attributes ( A ) \ B 
follow from A or not, and hence we have to ask the domain expert. 

We shall take these observations as guidelines for our further considerations. We start by 
generalizing our notion of a domain expert such that we allow partial counter examples. 
Next we present and discuss our general form of attribute exploration that incorporates 
the above mentioned ideas. For this we shall also prove correctness and non-redundancy 
of the questions asked to the expert. Subsequently, we shall have a closer look on how to 
compute undecided implications in our general setting as it is done in the classical case. 

Definition 7 Let M be a set. A function q: Imp(M) — > { T } U <P(M) 2 is said to be a 
partial domain expert on M if and only if T is an element not in ^P(M) and the following 
conditions hold: 

1. If for (A — > B) £ Imp(M)itholdsthat<7(A — > B) = (C,D) ^ T,thenCnD = 0, 
A C C and B PI D 7^ 0. (q gives sufficient counterexamples for false implications) 

2. If (A — > B), (X — > Y) £ Imp(M) are such that q(A — > B) = T and q(X — > 
Y) — (C, D) T, then if A C C then B H D = 0. (counterexamples do not refute correct 
implications) 

As in the case for domain experts, we say that q confirms an implication A — > B if and 
only if q(A — > B) = T. Otherwise we say that q rejects the implication and we call 
q(A — > B) T a counterexample from q for A — > B. Th(q) shall denote the set of all 
implications on M that are confirmed by q. () 

The counterexamples given by a partial domain expert can be seen as partial object de- 
scriptions that are enough to invalidate a given implication. 

Let us first investigate immediate consequences from the definition. One of those is the 
fact, as one would expect, that Th(^) is closed under entailment, i. e. Cn(Th(q 1 ) ) = Th(^). 

Lemma 8 Let L C Imp(M) for a set M and let q be a partial domain expert on M, such that 
q confirms all implications in C If C \= (A — > B) for some (A — > B) £ Imp(M), then q 
confirms A — > B as well. 
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Proof Suppose that q(A — > B) = (C, D) is a counterexample from q for A — > B. Then 
A C C. Now £(C) C M \ D by the second condition on partial domain experts. Since 
£ |= (A — > B), it follows that B C £(A) C £(C) C M \ D. Therefore, B n D = 0, 
contradicting the fact that (C, D) is a counterexample for A — > B from rj. □ 

Lemma 9 If(C,D)isa counterexample given by a partial domain expert q on M, then Th(q) (C) D 
D = 0. 

Proof By Lemma|8l confirms C — S- Th{q){C). Therefore, by the second condition in the 
definition of t/, it follows D n Th(^) (C) = 0, as required. □ 

Lemma 10 For a partial context K wrtfo attribute set M and a partial domain expert q on M it 
holds thatTh{q)(-) C K(-) ifandonlyifTh(q)(C) C M\D for each (C,D) G K. 

Proo/ Th(^) (■) C K(-) implies Th(^) (C) n D = for each (C, D) G K, which is equiv- 
alent to Th(q) (C) C M \ D. 

For the converse let Th(^) (C) n D = for all (C, D) e K. Let ACM. Then for every 
(C, D) G K with A C C, it follows that Th(^) (A) n D C Th(^) (C) n D = 0. Therefore 

Th(^)(^) n(J{ D I (C, D) e K, A C C } = 

and hence Th(q) (A) C K(A) as required. □ 

With those observations at hand, we are now able to state our generalized formulation 
of the attribute exploration algorithm. 

Algorithm 11 (General Attribute Exploration) Let M be a set, c ce rt/ c U m V closure opera- 
tors on M and q a partial domain expert M, such that c cer t(') Q Th(^) (■) C c un j v (-). 

i. Let K = 0. 

ii. Let A C M be finite and such that there exists a finite set B C M with c cer t(A) C 
B C c un i v (A). If there is no such set, terminate with output K and c cert . Otherwise 
consider the implication A — > B. 

iii. If q confirms A — > B, then update c cer t to be the closure operators whose closed sets 
are exactly the closed sets of c cer t that are also closed under { A — > B } . 

iv. Otherwise let (C, D) = q(A — > B) be a counterexample from q for A — ► B. Add 
(C,D)toK 

v. Replace all counterexamples (C, D) G K by (C', D'), where 

C := c cer t(C), 

D':=DU{meM\D c C ert(C U{m})nD/0}. 
vi. Update Cuniv to be the closure operator given by 

x^c» v (x)ni(x) 

for all XCM. 
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vii. Go to OH 



Starting from this reformulation of the attribute exploration algorithm we shall now 
consider the properties this algorithm has. We shall show in this section that the algorithm, 
as in the classical case, does not ask question its answers it could infer itself. Furthermore, 
the algorithm is correct in the sense that it returns a complete description of the domain 
the given partial domain expert represents. Termination, however, cannot be shown in 
general, and we shall only give some sufficient condition. 

The results in the minimality of the resulting set of confirmed implications does not 
hold in this general setting. For this, we have to generate the implications asked to the 
expert in a way similar to the classical case. We shall discuss this in more detail in the next 
section. 

To discuss the properties of Aleorithmllll we need the following result. 

Lemma 12 At the end of every iteration of the generalized attribute exploration algorithm it holds 
that c cer t(-) C Th(^)(-) C c un j V (-) for the current values o/c cer t and c un i v . In particular, 
Ccert(X) C K(X) holds for all X C M at the end of every iteration. 

Proof We prove the claim by induction. For the base case we observe that K = and 
therefore K(X) = M for all X C M. Furthermore c cert (■ ) C Th(^)(-) C c univ (-)bythe 
prerequisites of the algorithm. 

For the induction step assume that c cer t (• ) C Th(q)(-) C c un i v (■) holds at the begin- 
ning of the current iteration. Assume A, B C M finite such that c cer t( A) C B C c un i v (A), 
for otherwise nothing has to be shown. We now distinguish two cases: 

i. cj confirms A — > B. Then c cer t is updated to the value of 

c cert = X 1 > Ccert(£(c C ert(X))) 

where £ = { A — > B } and X C M. Since q confirms A — > B and c cert (-) C 
Th{q)(-), it follows that c' cen {-) C Th{q){-). 

In the situation before steplVJ by Lemma|9]for every element (C, D) 6 K it holds that 
Th((/)(C) n D = and hence c' cert {C) n D = 0. Moreover, C := c' cert (C) is also 
disjoint to 

D':=DU{meM\D | c' ceit (C U{m})nD^0} 

and (C' — > {m}) <£ Th(q) for m 6 D'\D. Therefore, after stepE Th(q)(C') C 
M \ D' for every (C, D') G K. Then by Lemma[10l Th(q) (•) C K(-) and therefore 
Ccert(') S Th(^)(-) C c univ (-) DK(-) as required. 

ii. gives (X, Y) as a counterexample for A — > B. Then in this iteration the value of 
c C ert is not changed. The counterexample that is effectively added to K is then 

(X',Y ; ) = (c C e r t(X), YU { m e M\Y \ c cert (X U { m }) n Y 7^ }). 

Since Th(^)(X') C M \ Y', from Lemma ITOl and the induction hypothesis it fol- 
lows that Th(<7) (•) C K(-). Together with Th(^)(-) C c un i v (■) we obtain c cer t (■ ) Q 
Th(q)(-) C Cuniv (•) nK(-) as required. □ 
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We shall at first investigate the already mentioned property that questions asked to the 
expert are somehow non-redundant. We state this kind of non-redundancy as the fact that 
the answer to a proposed implication is not predetermined by the current knowledge or 
by the answers given so far. 

Theorem 13 Let M be a set, c ce rt, Cuniv closure operators on M and q a partial domain expert 
on M such that c cer t(-) C Th(q)(-) C c un j v (-). Suppose that we are in the n + 1 iteration of 
Algorithm\l]]and suppose that the implication A — > B is asked to the expert q. 

Then for each m £ B there exist two partial domain experts q\, qi which return the same values 
as q in all iterations i £ { 1, . . . , n } and satisfy c cer t ( ■ ) C Th(qi)(-),Th(q2)(-) Q Cuniv( - )' 
such that q\ rejects A — > { m } and qi confirms A — > { m }. 

Proof Let c' ceTt , c 1 - , K 1 be the values of the corresponding closure operators and the cur- 
rent working context in iteration i £ { 1, . . . , n }, respectively. Furthermore, let A, — > B,- 
be the implication asked in iteration i. Finally, let T be a symbol not equal to any subset of 
M. 

We then define q\ as follows: 

!q(A — > B) if (A — > B) = (A; — > B,) for some i, 

T ifBCc c " ert (A), 
( c C ert ( A) , M \ c C ert ( A ) ) otherwise, 

for all (A — > B) £ Imp(M). Then q\ is a partial domain expert on M and Th(^i) = 
7h(c" eTt ). Since c" ert (-) C c„„,„ ( ■ ) by Lemmall2land m ^ Cc ert (A), rejects A — > {m}. 

To construct qi we consider the formal context K with object set K", attribute set M 
and incidence relation 1^ given by 



{C,D)I K x 



x £ c c ! ert (CU{>}) iim^D 
x £ C otherwise 



for all (C, D) £ K" and x £ M. By step |v] in Algorithm ITT1 all object intents of K are 
closed under Ccert/ therefore Th(c" ert ) C Th(K). Together with c ce rt (• ) Q c" ert (-) follows 

Ccert (•) CTh(K)(-). 

We now define qi by 

{q(A — > B) if (A — > B) = (Ai — > B,-) for some z, 

T ifBCA"nc^ iv (A), 

(X, M \ X) with X = A" n c^ niv (A) otherwise 

for all (A — > B) £ Imp(M). Thentj2 is a partial domain expert with Th( ^2) = Th(K) fl 
Th(c^ iv ). For this we observe that for (C, D) £ K",if m £ D, then c" ert (C U { m }) n D = 
by steplv) Therefore, the counterexamples given for some implication A,- — > B ( - from q 
can also be given by q^. 

Sinceccert(-) C Th(K)(-)andc ce rt(-) Q c n univ (-), it follows that c ce rt(-) C Th(^ 2 )(-) C 

Cuniv ( ' ) • 
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Furthermore, m 6 c^ niv (A) and since K" does not reject A — > B, it follows that for 
each (C, D) e K" with A C C that m ^ D. Hence, m 6 A" and therefore ^2 confirms 
A — >■ B as required. □ 

One of the crucial features of attribute exploration is that it returns a complete descrip- 
tion of the domain of discourse upon termination. This property does also hold for our 
generalized formulation. 

Theorem 14 Let M be a set, c cer t, c U niv closure operators on M and let q be a partial domain 
expert on M. Furthermore, suppose that c cer t(-) Q Th(q)(-) C c un j v (-). 

Suppose that Algorithm [IT] terminates on input c cer t, c un i v and q and denote the returned 
partial context by K and the returned closure operator by c. Let XCM such that c(X) is finite. 

i. Th( <? )(X) = c(X). 

a. c(x) = Cuniv (x)nK(x). 

Hi. Let K, be the set of all implications which have been confirmed by q during the run of the 
algorithm. Define c'(X) to be the smallest set that contains X and is closed under both c cer t 
andJC(-). Then c'(X) = c(X). 

iv. Let K = (K, M, I) where 

(C,D)Im -<=^> meC. 

Then 

c(x) = c univ (x)nx", 

rofcere (■)" denotes the double derivation operator in K. 

Proof By Lemma [12l c( Ert (-) C Th(^)(-) C c' ■ (•) holds at the end of every iteration in 
the run of the algorithm, where c' celt and cj m j v denote the current values of the correspond- 
ing closure operators. Since the algorithm terminates, c' celt (Y) = c( iniv (Y) holds in the last 
iteration for all Y C M if c' cert (Y) is finite. Since c = c' cert and c^, ert (X) C Th(^)(X) C 
c(j niv (X), the first assertion follows. 

By induction on the number of iterations of the algorithm, one can see that at the end 
of every iteration of the algorithm it holds that (X) = (X) nK(X), where 
is the current value of the upper closure operator, c un i V is the original value of the upper 
closure operator and K is the current working context. Since the algorithm terminates, 
c' i v (X) = c(X) holds in the last iteration and the second claim follows. 

Suppose that the algorithm is in a certain iteration and suppose that JC is the set of 
confirmed implications up to now. By induction we see that if c' celt is the current value of 
the lower closure operator, then c' celt (X) is the smallest set containing X that is closed both 
under c cer t and JC (•). As c is the last value of the lower closure operator during the run 
of the algorithm, c(X) = c'(X), which shows the third claim. 
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For the last claim we observe the following relations: 

x" = n c 

(C,D)GK,XCC 

C fl M\D 

(C,D)eK,XCC 

= K(X). 

By step|V]of the algorithm, C is closed under c for every (C, D) £ K. Therefore, c(X) C 
X". Together this yields 

cuniv(x) n C (x) c Cuniv (x) n x" c c univ (x) n k(x) 

and since c(X) C c un i v (X) and c(X) = c un j v (X) PI K(X), the last claim follows. □ 

Termination of the generalized attribute exploration algorithm is not guaranteed in 
general (i. e. when M is infinite and c cer t and c un j v are arbitrary). Hence, termination nor- 
mally has to be shown for the concrete application at hand. We can, however, give some 
sufficient condition which may still be helpful. 

Theorem 15 The general attribute exploration algorithm with input c cer t, c un i v and a partial 
domain expert q terminates if there are only finitely many closure operators c on M such that 
Ccert(') £ c(-) C CunivO). 

Proof The claim follows easily if we can show that in every iteration of attribute exploration 
either the value of c cert is updated to a new value c' cert such that c cert C c' cert C or, 
likewise, if the value for is updated to a new value c' univ such that c cer t C C 

Let A be such that c ce rt (A) ^ c univ (A) and let B C M be finite such that c cert (A) C 
c univ (A). If (J confirms A — > B, then c cer t is updated to the value 

L cert 

(X) = c cert (£(c cert (X))), 
where £ = {A — > B } and X C M. Clearly, c cer t(-) £ 4 ert (•) and by Lemma [T2] 

c cert(') — c univ(')- 

If q yields a counterexample (C, D) for A — ► B, then the new value c univ for Cuniv is 
computed by 

C(x) = c m (x)nK(x) 

for X C M. It follows that c' niv (-) C c univ (-) and c univ (A) C c univ (A) \ D C ^^(A), 
sinceC C A,B C c univ (A) and BflD ^ 0. By Lemma[12]it follows that c cert (X) C K(X) 
for all X C M. Hence c cert (-) C c univ (-) C c muv (-) as required. □ 

Of course, if after finitely many iterations the situation of the theorem is reached, the 
generalized attribute exploration will terminate as well. 
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5 Computing Undecided Implications 



We have seen that a lot of the useful properties of attribute exploration remain true in our 
generalized form of Algorithm [TT] However, we have not discussed the property of the 
classical attribute exploration that the number of questions which the expert confirms is 
minimal. Indeed, we cannot expect that from our generalization, as we have not opposed 
any restriction on the order in which implications are asked. It is therefore possible to ask 

an implication A > B, which is confirmed, just to ask in the next iteration an implication 

A — > C with C D B, which might also get confirmed. It is therefore advisable to always 
ask implications with C -maximal conclusions. However, even in that case it might not be 
clear whether the number of confirmed implications asked is really minimal. 

We therefore want to discuss in this section whether it is possible to modify our gen- 
eral attribute exploration such that the number of questions asked such that the expert 
confirms is the smallest possible. For this we shall try to adapt the computation of unde- 
cided implications from the classical case. 

Let us recall how implications asked to a domain expert p are computed in the case of 
classical attribute exploration, as discussed in Algorithm [5] For this suppose that we are 
in a certain iteration of the algorithm, with known implications /C, working context K and 
P the last computed premise. Further suppose that we have fixed a total order on the set 
M before the start of the algorithm, which induces a lectic order ^ on Cp(M). Then, in the 
classical case, we compute the lectically smallest set Q C M after P that is closed under K, 
and that is not an intent of K. The implication Q — > Q" is then asked to p. 

Computing the lectically next set after a set P can be done using the Next-Closure 
algorithm \ 7\. However, for theoretical considerations we can neglect lectic orderings, as 
we shall see in a moment. 

Let M be a finite set. To guarantee that the number of confirmed implications is as 
small as possible, we change steplHlto: 

ii'. Let A C M be such that A = c cer t(A) C c ur ,j v (A) and A is C -minimal with respect 
to this property. Consider the implication A — > c ur d v (A). 

This is a generalization of the corresponding step in the classical case. If P is the premise 
of the last implication asked, then the lectically next set Q after P is a C -minimal set with 
Q = /C(Q) C Q", and the implication Q — > Q" is asked next. 

Before we give the formal statement of the fact that this indeed yields an algorithm 
that always asks a minimal number of confirmed implications, we shall give the following 
definition. 

Definition 16 Let c\, Ci be two closure operators on a finite set M and let P C M. Then P 
is said to be c\-pseudoclosed under Ci if and only if 

i. c 1 (P) = P, 

ii. c 2 (P)^P, 

iii. for all Q C P being Ci-pseudoclosed under c 2 it follows that C2(Q) C P. 



14 



Theorem 17 Consider Algorithm\T]]with step\U\replaced by step\u\. 

Let M be a finite set, q a partial domain expert on M, c cer t, c un i v closure operators on M such 
that c C ert (•) C Th(q)(-) C c unrv (-). Let K. be the set of confirmed implications during the run 
of the algorithm with input c cer t, Cuniv an & <?/ an d 1st c be the returned closure operator. 

Then the premises of the implications in K, are exactly the c cei t~pseudoclosed sets of c. 

Proof We show that a set A C M is a c cer t-pseudoclosed set of c if and only if the impli- 
cation A — > c(A) is asked to and confirmed by q. We shall do so using well-founded 
induction, which is possible since M is finite. 

Let A be a premise of a confirmed implication A — > B. It follows that B — c univ (A) 
for the corresponding value of c[ lniv in the iteration in which A — > B is asked to q. Then 
A is closed under c cer t and under all currently known implications, i. e. under 

{X — >Y\(X — > Y) 6 K, X C A }. 

Suppose that their exists an implication (X — > Y) E K such that X C B. Then 
Y C c' imiv (B) — c( miv (A) = B. Therefore, B is closed under K, and hence B = c(A). 

We shall show next that A is a c cer t-pseudoclosed set of c. We already know that A 
is closed under c cer t. Furthermore, since A — > B is asked to q, B 7^ A and therefore 
A^c(A). 

Let RCA be a c C ert-pseudoclosed set of c. By the induction hypothesis, R — > c(R) 
is asked to and confirmed by q. Since A is closed under all those implications, it follows 
that c(R) C A as required. 

Conversely, let A be a c cer t-pseudoclosed set of c. By the induction hypothesis, for all 
Ccert-pseudoclosed sets RCA the implication R — > c(R) is asked to and confirmed by 
q. Since c(R) C A and c cer t(A) = A it follows that A is C-minimal with respect to being 
closed under c ce rt arid all confirmed implications X — > Y with X C A. Therefore, A — > 
c univ (A) will be asked in a certain iteration, with the corresponding value of cC^ v . Since 
c(A) C c( miv (A) and A 7^ c(A), after a finite number of counterexamples A — > c(A) 
will be asked to and confirmed by q. □ 

Recall the fact that the set 

K. := { P — > c(P) I P is Ccert-pseudoclosed set of c } 

has minimal cardinality such that every set A C Mis closed under c if and only if A is 
closed under c ce rt and K.. This has been proven in |5[ for the case of c cer t — K(-) for a set 
K, C Imp(M) and c = (■)" for some given formal context K with K |= K,. However, the 
proof given there also holds in our general setting. 
Summing up, we obtain our desired result. 

Corollary 18 The number of confirmed implications during the run of the general attribute explo- 
ration algorithm is as small as possible. 



15 



6 Conclusions 



Starting from a classical formulation of attribute exploration using domain experts, we 
have presented a more general formulation of attribute exploration that is able to work 
with abstractly given closure operators and can handle partially given counterexamples. 
We have also seen that most of the properties of classical attribute exploration remain in 
general or, as in the case of minimality of confirmed implications, under certain restric- 
tions. 
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